Two Experiments on Retrieval With Corrupted Data and Clean Queries in the TREC-4 Adhoc Task Environment: Data Fusion and Pattern Scanning
نویسندگان
چکیده
We report on several experiments in using data fusion to improve information retrieval, and in approximate text and 5-gram mathcing methods for retrieval of corrupted text, in the TREC context.
منابع مشابه
ANU/ACSys TREC-5 Experiments
A number of experiments conducted within the framework of the TREC-5 conference and using the Parallel Document Retrieval Engine (PADRE) are reported. Several of the experiments involve the use of distance-based relevance scoring (spans). This scoring method is shown to be capable of very good precision-recall performance, provided that good queries can be generated. Semi-automatic methods for ...
متن کاملQueries for High Precision and Recall ( MultiText Experiments for TREC - 7 )
The main aim of the MultiText experiments for TREC-7 was to derive very short queries that would yield high precision and recall, using a hybrid of manual and automatic processes. Identical queries were formulated for adhoc and VLC runs. A query set derived automatically from the topic title words, with an average of 2.84 terms per query, achieved a reasonable but unexceptional average precisio...
متن کاملDeriving Very Short Queries for High Precision and Recall (MultiText Experiments for TREC-7)
The main aim of the MultiText experiments for TREC-7 was to derive very short queries that would yield high precision and recall, using a hybrid of manual and automatic processes. Identical queries were formulated for adhoc and VLC runs. A query set derived automatically from the topic title words, with an average of 2.84 terms per query, achieved a reasonable but unexceptional average precisio...
متن کاملPassage-based query refinement (MultiText experiments for TREC-6)
The MultiText information retrieval system nds passages of text, as opposed to complete documents, that are likely relevant to a particular topic. Passage retrieval provides the basis for the relevance ranking, term expansion, interactive user interface, and distributed searching used in the MultiText experiments for TREC-6. The essence of relevance ranking is that shorter passages containing a...
متن کاملCluster-Based Relevance Feedback: Legal Track 2011
This is our second participation in the TREC Legal Track. The TREC Legal Track 2011 featured only the Learning Task. We participated in Topics 401 and 403. We used Lemur 4.11 for Boolean retrieval and followed it with a clustering technique, where we chose members from each cluster (which we called seeds) for relevance judgement by the TA and assumed all other members of the cluster whose seeds...
متن کامل